arXiv: 1505.07652v2 [cond-mat.stat-mech] 12 Oct 2015 
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We derive expressions for the dispersion for two classes of random variables in Markov processes. 
Random variables like current and activity pertain to the first class, which is composed by random 
variables that change whenever a jump in the stochastic trajectory occurs. The second class corre¬ 
sponds to the time the trajectory spends in a state (or cluster of states). While the expression for 
the first class follows straightforwardly from known results in the literature, we show that a similar 
formalism can be used to derive an expression for the second class. As an application, we use this 
formalism to analyze a cellular two-component network estimating an external ligand concentration. 
The uncertainty related to this external concentration is calculated by monitoring different random 
variables related to an internal protein. We show that, inter alia, monitoring the time spent in the 
phosphorylated state of the protein leads to a finite uncertainty only if there is dissipation, whereas 
the uncertainty obtained from the activity of the transitions of the internal protein can reach the 
Berg and Purcell limit even in equilibrium. 

PACS numbers: 02.50.Ey, 87.10.Vg, 05.70.Ln 


I. INTRODUCTION 

Markov processes are used to model a wide variety 
of physical, chemical and biological phenomena [ll-Q- 
While calculating the mean of a random variable is often 
relatively straightforward, obtaining its dispersion can be 
much harder. A prominent example is the expression for 
the dispersion of the number of steps of a particle hop¬ 
ping in a one-dimensional lattice Q. 

Fluctuating currents are important random variables 
in stochastic thermodynamics If their mean is 

nonzero in the steady state, the system is out equilib¬ 
rium. A standard method to calculate the dispersion of 
this random variable is to obtain the scaled cumulant 
generating function as the maximum eigenvalue of the 
so-called modified generator Q. However, determining 
this eigenvalue as a function of the transition rates is 
typically difficult, even for systems with a small number 
of states. 

The problem of calculating the dispersion of fluctuat¬ 
ing currents can be overcome with an elegant method 
introduced by Koza Q- With Koza’s method, instead 
of calculating the maximum eigenvalue, the dispersion 
is obtained from coefficients of the characteristic poly¬ 
nomial associated with the modified generator. For in¬ 
stance, this method is useful to prove a bound on the 
minimal number of states of a one-dimensional lattice in 
terms of this dispersion @ (see [1, [l^ for discussions on 
the importance of this bound). Recently, we have used 
this method to derive a bound on the minimal ener geti c 
cost of small uncertainty in biomolecular processes [TJ. 
Other references that use this method are (see 

also [T6l - [l^ for related methods that use perturbation 
theory). 

As we point out in the present paper, Koza’s method 
can be used to calculate the dispersion of any random 


variable that changes whenever a jump occurs during a 
stochastic trajectory. Besides fluctuating currents, an¬ 
other random variable pertaining to this class is the dy¬ 
namical activity, for which the total number of transi¬ 
tions is counted. This activity appears in fluctuation- 
dissipation relations out of equilibrium [T^ . and is 
studied in so-called dynamical phase transitions [2ll - [^ . 

We show that a similar method can be used to cal¬ 
culate the dispersion of a second class of random vari¬ 
ables. This class corresponds to variables counting the 
fraction of time a stochastic trajectory spends in a cluster 
of states. Our calculations lead to an expression for the 
dispersion of this variable in terms of the coefficients of 
the characteristic polynomial associated with a discrete¬ 
time modified generator. 

The time spent in a state is a key random variable in 
the problem of a cell estimating an external ligand con¬ 
centration by monitoring the time a receptor is in the 
bound state, i.e., occupied by an external ligand. The 
uncertainty on the estimated concentration sets a fun¬ 
damental limit on the cell’s ability to sense the external 
environment, as established in the seminal work of Berg 
and Purcell [2^ (see also [2^ - 1^ ). 

We analyze a four-state model for a two-component 
network that estimates an external concentration HH- 
[^ . The two-component network is composed by the 
receptor and an internal protein that can be phospho¬ 
rylated due to ATP consumption. This model provides 
an example for which we can calculate the dispersion for 
three different random variables: current and activity 
from the first class, and time spent in the phosphory¬ 
lated state from the second. Specifically, we compare the 
uncertainty of the estimated external concentration that 
is obtained from these three different random variables 
and discuss its relation with energy dissipation. 

The paper is organized as follows. The expression for 
the dispersion of the first class of random variables is 
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provided in Sec. m In Sec. imi we introduce a method 
to calculate the dispersion for the second class. Sec. HVl 
contains the application to the problem of sensing an 
external concentration. We conclude in Sec. 0 In Ap¬ 
pendices El and m we perform detailed derivations for 
the first and second classes of random variables, respec¬ 
tively. The calculations of the four-state Markov process 
considered in Sec. m are explained in Appendix ICl 


Furthermore, the dispersion is defined as 

,7) 

An expression for these quantities in terms of the tran¬ 
sition rates is obtained in the following way. The scaled 
cumulant generating function associated with Xi is given 
W the maximum eigenvalue of the modified generator 


II. DISPERSION FOR CURRENT AND 
ACTIVITY 


The transition rate from state i to state j of a generic 
continuous-time Markov process with a finite number of 
states N is denoted by Wij. The time evolution of the 
occupation probability of state i at time t is determined 
by the master equation 

= XI - PAT)w^:j]. (1) 


For the discussion below it is convenient to write this 
equation in the from 

Xp(r) = £P(r), (2) 

where P(t) is the probability vector with N components 
and C is the stochastic matrix with elements 



- T,k Wik 


j 

\ii = j 


( 3 ) 


Furthermore, the stationary probability of state i is writ¬ 
ten as Pi- Two key quantities in this paper are the (sta¬ 
tionary) average probability current from state i to j 


Jij — Pi'^ij (^) 

and the average activity 

^ij = Pi'^ij T Pj^ji' (3) 


This activity is the average number of transitions per 
time between the pair of states ij. 

Quite generally, we consider a random variable Xi that 
is a functional of the stochastic trajectory. This trajec¬ 
tory is a sequence of jumps during a time interval t, which 
is assumed to be long, i.e., the expressions below are valid 
in the formal limit t —>■ oo. The random variable Ai is 
then characterized by the increments Oij: it increases by 
Oij whenever a jump from i to j occurs. If we choose the 
increments 0ij = —Oji = I and zero for any other pair 
of states, we obtain {Xi)/t = Jij, where here and in the 
following the brackets denotes an average over stochastic 
trajectories in the stationary state. On the other hand, 
the choice 6ij = Oji = 1 results in {Xi)/t = Ay. The 
“velocity” associated with the generic variable Xi is 


Vi 


(^i) 


[£{z)]ij — 


Wji ex.p{z9ji) j 

- J2k if * = J 


( 8 ) 


The characteristic polynomial of this matrix reads 

N 

det {yl - Liz)) = '^Cr,{z)y'^, (9) 


n—0 


where Cniz) are coefficients of the polynomial and / is 
the identity matrix. It turns out that the velocity and dis¬ 
persion can be written in terms of these coefficients and 
its derivatives at z = 0. This procedure is very conve¬ 
nient since we can calculate the dispersion without know¬ 
ing the full scaled cumulant generating function, which 
is a root of this polynomial. The specific expressions are 


vi = - 


Cl’ 


( 10 ) 


and 


Di 


C(,' + 2C[vi + 2C2 vI 
2Ci 


( 11 ) 


where primes denote derivatives with respect to z and the 
lack of explicit z dependence in the coefhcients denotes 
evaluation at z = 0. These expressions are derived in 
appendix El They have been obtained by Koza Q for 
the random variable Xi being the current. We are simply 
pointing out here that they are also valid for any variable 
Xi with generic increments Oij, i.e., in particular also for 
the activity. 


III. DISPERSION FOR THE TIME SPENT IN A 
STATE 


The second class of random variables X 2 corresponds 
to the time the stochastic trajectory spends in a state or 
in a cluster of states. The first moment of this random 
variable is easy to determine. For example, the average 
time spent in state i is the stationary probability Pi mul¬ 
tiplied by the total time, i.e., {X 2 ) = PJ. Generally, we 
denote a cluster of states by F, with = 1 if state i G F 
and 5i^r = 0 otherwise. The “velocity” associated with 
X 2 is defined as 


V2 


(^ 2 ) 


y^^PA.r- 


t 


( 6 ) 


t 


( 12 ) 
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Furthermore, the dispersion reads 


D2 


{X^) - {X2? 

2t 


(13) 


We discretize time in order to calculate this dispersion. 
The discrete-time stochastic matrix is 


]C = I + eC, (14) 


where e is the size of the time-interval. Since we are in¬ 
terested in the continuous-time limit, at the end of the 
calculation the limit e —>■ 0 is taken. We define the mod¬ 
ified matrix 




ewji exp(2;(5r.i) if * ^ j 

(1 - eZ)/c'^*fc)exp(2(5r,i) if * = j 


( 15 ) 

The maximum eigenvalue of this modified matrix gener¬ 
ates the moments of a random variable that is the number 
of discrete-steps the system is in states pertaining to T 
during the time interval t [s^. Defining the coefficients 


N 

det {yl - IC{z)) = ^ Cn{z)y'^, 

n—0 


we obtain 


and 


V2 = 


c' 

A^n—0 


D, = - 


- lim /(e), 
2 £-*■0'^ ^ ^ 


(16) 


(17) 


(18) 


where 


Elo < + 2v2 ELi nc'„ + uj - l)c. 

En=l 'TlCn 




N 


/(e) = 


(19) 

These expressions for velocity and dispersion are derived 
in appendix m where we also show that Eqs. dUD and 
()19|) are in perfect agreement with numerical simulations. 
The expression for the dispersion of this second class of 
random variables is the main technical result of this pa¬ 
per. Given a continuous-time Markov process with a 
small number of states, small in the sense that the matrix 
in Eq. (HSl) can be handled analytically, one can readily 
obtain the dispersion D 2 as a function of the transition 
rates using Eqs. m and (HU). 


In the next section we calculate the dispersion from dif¬ 
ferent random variables and analyze their relation with 
energy dissipation. Related studies that analyze the rela¬ 
tion between the relative uncertainty associated with an¬ 
other random variable, the life-time of cluster of states, 
with energy dissipation are [13, for a biological timer, 
and for a receptor that dissipates chemical energy. 
In general, this lifetime of a cluster of states is different 
from the time an stochastic trajectory spends in a cluster 
of states. 



FIG. 1: (Color online) Two-component network. External 
ligands, represented by the black spheres, can bind to and 
unbind from the receptor siting on the cell membrane. The 
chemical reaction rates of the protein inside the cell Y depend 
on the occupancy of the receptor. 


IV. INFERENCE OF AN EXTERNAL LIGAND 
CONCENTRATION BY AN INTERNAL 
PROTEIN 


A. Uncertainty and inference time 


As an application, we consider a two-component net¬ 
work of a cell estimating an external concentration c rep¬ 
resented in Fig. [T] A review on two-component signaling 
networks is given in [40j |. In our model, transition rates 
can depend on c, and, therefore, the moments of a ran¬ 
dom variable X, pertaining to any of the two classes, will 
depend on c. The relative uncertainty in estimating c by 
monitoring X for a time t is written as 




((A^) - (A)2) 



-2 


2D = -T. 


( 20 ) 


Characteristically, the uncertainty decreases with the in¬ 
tegration time t. The prefactor T is the integration time 
required to get a relative uncertainty smaller than 100%. 
We shall refer to it as the inference time. Below we will 
calculate this inference time obtained from different ran¬ 
dom variables in a two-component network. The infer¬ 
ence time can also be expressed as the inverse of the 
signal-to-noise ratio normalized by the integration-time 
time multiplied b y t he relative change in the external 
concentration [dll, 42|. 


B. Berg and Purcell Limit 


The receptors sitting on the cell membrane constitute 
the first layer of the two-component sensory network. In 
the two-state model for a single receptor shown in Fig. [21 
the receptor can be either bound by a ligand or free. The 
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FIG. 2: (Color online) Two-state single receptor model. The 
bound state corresponds to the receptor with a ligand. The 
arrows indicate the transition rates from unbound to bound 
TjC^c/co and from bound to unbound ■ 


parameters determining the transition rates in Fig. [5] are 
the external concentration c, the concentration for which 
both states are equally probable cq, and the average life¬ 
time of the bound state The key feature of these 
transition rates is that the binding rate is proportional 
to the external ligand concentration c. 

If the concentration is estimated by monitoring the 
time spent in the bound state, the inference time is 

T2 = 2n^±^, ( 21 ) 

C 

which is the result obtained by Berg and Purcell [^ . 
The subscript in 72 indicates that the random variable is 
from the second class. 

If instead of monitoring the time spent in the bound 
state the activity, i.e., the total number of transitions is 
monitored, a different uncertainty is obtained. In [T7| . 
it has been shown that the inference time obtained from 
the activity is 

For low concentrations c <C cq, the uncertainty obtained 
by monitoring the activity is half of the Berg and Purcell 
uncertainty, i.e., 7i = 72/2. This same limit is achieved 
by estimating the concentration with a maximum likeli¬ 
hood method applied to the probability of a trajectory of 
the two-state model [l^. It is straightforward to obtain 
Eq. (1221) with the formalism from Sec. m and Eq. m 
with the formalism from Sec. uni in both cases one has to 
calculate the characteristic polynomial of a 2x2 matrix. 


C. Monitoring the internal protein 


The second layer of the two-component network is 
an internal protein which can be in states Y and Y*, 
where Y* is the phosphorylated form of the protein. The 
full two-component network then has four states [s^, as 
shown in Fig. [31 Similar models for two-component net¬ 
works consider a large number of internal proteins [Slj. 
If the receptor is bound by a ligand the following phos¬ 
phorylation reaction can happen 

Y + ATP yY*+ ADP, (23) 




FIG. 3: (Color online) Four-state model accounting for the in¬ 
ternal protein Y. The transition rates for the internal protein 
are defined in Eqs. (p3ll and (|24ll. 


where k± are transition rates. If the receptor is empty, 
dephosphorylation may take place, i.e., 

Y*yY + Pi. (24) 


For thermodynamic consistency these transition rates 
must fulfill the generalized detailed balance relation Q 


K-OJ- 


(25) 


where A/i is the free energy liberated in one ATP hydrol¬ 
ysis and we are setting ksT = 1 throughout. This model 
is in equilibrium only if = 0, which means that no 
energy is dissipated. Otherwise, chemical free energy is 
dissipated, with the rate of dissipated heat being charac¬ 
terized by the entropy production Q. 

If we are interested in the occupation of the receptor, 
we obtain the two-state model from Fig. lUby integrating 
Y out since the receptor is independent of the internal 
protein. Hence, the inference time associated with the 
time the receptor is bound in this four-state model is 
as given by Eq. (ED and the one associated with the 
activity of binding/unbinding transitions is as given by 
Eq. dill). 

The external concentration can also be estimated by 
monitoring random variables related to the internal pro¬ 
tein. We consider three random variables: the time the 
internal protein is in the phosphorylated form, the proba¬ 
bility current and the activity of the transitions involving 
the Y protein. Note that the present four-state model has 
only one probability current, which, for A/r > 0, goes in 
the clockwise direction in Fig. [S] The inference time is 
denoted by for the time spent in the phosphorylated 
state. For random variables of the first class we denote 
TJ the inference time related to the current and Tj the 
one related to the activity. The calculation of these quan¬ 
tities using the expressions from Sec. [iTl and Sec. [TTIl is 
explained in Appendix |Cj 
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D. Inference time and dissipation 


We first compare Tj with ■ In equilibrium, the 
current is zero. Hence, the inference time Tj diverges 
for A/i = 0. It turns out that also for monitoring the 
time spent in the phosphorylated state Y* the inference 
time diverges in equilibrium. This divergence comes 
from the fact that the sensitivity dv^ jdc goes to zero in 
equilibrium, since the probability of being in the cluster 
of states for which the protein is phosphorylated becomes 
independent of c for A/r = 0. 

For the current variable we can obtain the following 
relation between heat dissipation and inference time. In 
m we have obtained a general relation between uncer¬ 
tainty and dissipated free energy that for the present 
model reads 


2Dja/[iv^r]>2, (26) 

where a is the thermodynamic entropy production that 
quantifies the rate of dissipation due to ATP consump¬ 
tion. This entropy production is simply given by cr = 
A/iuJ, where uj is the average current. From the defini¬ 
tion of inference time in Eq. (l20l) and from Eq. (1261) we 
obtain 


r/>2 


/51^\ 
\ 9Inc / 


-2 


^-1 


(27) 


where the term is the sensitivity of the average cur¬ 

rent to changes in c. Prominently, this lower bound on 
inference time decreases with increasing dissipation. This 
expression provides a tradeoff between inference time, 
sensitivity and dissipation. Since relation (1261) is gen¬ 
eral, the tradeoff relation (HZD is valid beyond the specific 
model considered in this section. 

For the following calculations we set the transition 
rates to 


K+ = LO+ = Ty and K- = lo- = Ty^, 

(28) 

where Ty is the time-sale of the reversed transitions in 
Eqs. (1^ and (IMl) . We define the dimensionless param¬ 
eter 


J=Tyln. (29) 

First we take 7 —>■ 0, which corresponds to the internal 
protein reactions being fast. The inference time related 
to the current in this limit is 

Y (c^ + Co)(e^^ + 1) + 4ccoe^'"/2 c+cq 
Tj = -c2(eAM/2 _ 1)2 -( 30 ) 

For large A/r, this inference time reaches its minimal 
value given by the uncertainty obtained with the activ¬ 
ity of the binding/unbinding transitions 7i in Eq. (1^^ . 
In this case, the phosphorylated form of the protein Y* 



FIG. 4: (Color online) Inference times related to the current 
TJ (black solid line) and the time spent in the phosphorylated 
state (red dashed line) as a function of A/i for 7 = 10“^, 
Tb = 1, and c/co = 1/2. 


is strongly correlated with the bound state, while Y is 
correlated with the unbound state. The binding of lig¬ 
and is immediately followed by the transition Y —>■ E*, 
whereas the unbinding of a ligand is followed by E* —>■ E. 
Therefore, monitoring the current becomes equivalent to 
monitoring the activity of the receptor. 

In the limit 7 —>■ 0 with any finite A/r > 0, the time 
spent in the phosphorylated state leads to = T 2 , with 
72 given in Eq. (I^Tl) . While for a large A/t > 0 this re¬ 
sult can be explained by the fact that E* (E) and bound 
(unbound) receptor become strongly correlated it is sur¬ 
prising that for any finite A/r > 0 monitoring the internal 
protein is equivalent to monitoring the receptor. For both 
random variables, current and time-spent in the phospho¬ 
rylated state, the best uncertainty that a variable related 
to the internal protein can reach is bounded by the uncer¬ 
tainty obtained by monitoring a corresponding random 
variable related to the receptor directly. The main differ¬ 
ence between these two random variables in the present 
limit is that the precision related to the time spent in 
the phosphorylated state does not improve by increasing 
A/i. Hence, there is a region for which increasing energy 
dissipation leads to an exponential decrease in 1~J , while 
T 2 remains approximately constant, as shown in Fig. 2] 

Another important limit corresponds to the case where 
the reversed transitions of the internal protein are much 
slower than unbinding of the ligand, i.e., 7 —^ 00 . In 
this limit, where the relevant time-scale for the averaging 
time required for a small uncertainty is Ty ^ r^, the 
inference times related to current and time spent in the 
phosphorylated state are 

Y _ (c -f co)^[4ccoe^'"/^ -b (c^ -f c§)(e^^ -b I)] 

c(c-co) 2 co(e^^/ 2 -l) 2 (e ^/^/2 + l) 

(31) 

and 

Y _ 2(c + co)^[(c^ -b cg)e^^/^ -b cco(e^^ -b 1)] _ 

^ c 2 c 2 ( e ^'^/2 - l ) 2 ( eA /^/2 + 1 ) 

(32) 

respectively. In these expressions we are assuming that 
the prefactors are large compared to TbjTy. Particularly, 
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FIG. 5: (Color online) The ratio [Tj in the ln(c/co) x A/r 
plane for 7 = 100 and rt, = 1. The region in blue corre¬ 
sponds to /TJ < 1 , where monitoring the current leads 
to a smaller inference time compared to monitoring the time 
spent in the phosphorylated state. 


they are faster for a free receptor. The inference time 
with the rates in Eq. ((33ll becomes 

(c + co)[(a-l)22ccg-h7-S])_ 

where B = a[c^ + 2c^co + cc§) -I- (cg + c^cq + 2ccq). For 
large a, TX -f T 2 , with T 2 given by Eq. (ED. This result 
can be explained as follows. For a ^ 1, whenever the 
receptor is bound a large number of internal transitions 
take place while for a unbound receptor this number is 
much smaller. Hence, counting the number of transitions 
of the internal protein, i.e., the activity of the internal 
protein, becomes equivalent to monitoring the time spent 
in the bound state. The surprising feature here is that 
depending on the random variable, it is possible to reach 
the Berg and Purcell limit by monitoring the internal 
protein with no energy dissipation. 


for large A/i both the prefactor in Eq. (1511) and the pref¬ 
actor in Eq. ((5^ decay with In this case, the 

expressions remain valid for Pi/T/- Compar¬ 

ing Eq. ED with Eq. ([32D we see that for c close to 
Co Tj > TX: while for c > cq or c < cq, TJ < TX■ 
Hence for intermediate values of the concentration close 
to Co the inference time related to the current is larger. 
This feature is quantified in Fig. [5l showing that the ratio 
TX/'Tj is smaller than 1 in an intermediate concentra¬ 
tion range. The precise range for which monitoring the 
time spent in the phosphorylated state leads to a smaller 
uncertainty than monitoring the current depends on A/r. 


E. Dissipationless inference 


Finally, we analyze the inference time arising from 
monitoring the activity of the transitions involving 
changes in the internal protein Y. For the transition 
rates set in Eq. (E51) we observe that TX > T/ ■ Fur¬ 
thermore, in the limit of 7 ^ 0 and large A/r, for which 
Y* and bound receptor are strongly correlated, the infer¬ 
ence time TX = Tj = 71, with 7i given by Eq. (1^ . 

The main feature of the inference time TX is that it can 
be finite even in equilibrium. Therefore, while the other 
two variables related to Y, namely, the current and the 
time spent in the phosphorylated state, require dissipa¬ 
tion to infer an external concentration, by monitoring the 
activity it is possible to infer the concentration without 
any dissipation. In order to make this feature transparent 
we set the transition rates as 

K+ = At- = cLTy^ and (jj+ = uj- = r“^, (33) 

which from Eq. (1251) imply Afj, = 0. The factor a > 0 
quantifies how much faster the transitions that change 
Y are when the receptor is bound. If a > 1 internal 
transitions are faster with the receptor bound and if o < 1 


F. Comparison to related work 


Let us compare our results with recent work on two- 
component signaling networks that estimate an external 
concentration [3 HE]- In these works the random 
variable is the fluctuating number of internal proteins 
that are in the phosphorylated (or active) state, which 
can be viewed as memory of the receptor occupancy. As 
demonstrated in 31| , the uncertainty related to this ran¬ 
dom variable diverges in equilibrium, similar to current 
and time spent in the phosphorylated state. A main point 
of the present paper is that the relation between dissipa¬ 
tion and uncertainty depends on which random variable 
we choose to estimate the external concentration, as man¬ 
ifested by the fact that activity allows for dissipationless 
inference. We note that for a different model with in¬ 
ternal proteins directly binding to free receptors the un¬ 
certainty estimated from the number of internal proteins 
can also be finite in equilibrium 


The rate of mutual information in bipartite systems, 
for which the two-component network is an example, has 
been studied in [H, . In these references it has been 

shown that the rate of mutual information between an 
external process, which corresponds to the receptor in 
our model, and an internal process, corresponding to the 
protein Y, can be non-zero even in equilibrium. A pos¬ 
teriori, this result indicates that even in equilibrium the 
inference of an external concentration with finite uncer¬ 
tainty should be possible if an appropriate random vari¬ 
able is monitored. We have shown that the activity is a 
random variable that achieves this dissipationless infer¬ 
ence. 


V. CONCLUSION 


We have developed a general formalism that allows for 
the calculation of the dispersion of the time a stochastic 
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trajectory spends in a cluster of states that can be ap¬ 
plied to arbitrary Markov processes with a finite number 
of states. This method is potentially useful for any ap¬ 
plication where the dispersion of this random variable is 
of interest. Furthermore, our formalism is similar to the 
formalism to calculate the dispersion for another class of 
random variables comprising current and activity, which 
has been pioneered by Koza Q. The expressions in Sec. 
m and Sec. m thus provide a unified recipe to calculate 
the dispersion of these two classes of random variables. 

These expressions have been used in the four-state 
model for a two-component network estimating an exter¬ 
nal ligand concentration in Sec. |TVl With this applica¬ 
tion, we have compared the uncertainties of the external 
concentration that are obtained by monitoring three dif¬ 
ferent random variables: current, activity of the internal 
protein, and the time the internal protein is in the phos- 
phorylated state. The main results obtained with our 
case study are as follows. The best performance (small¬ 
est uncertainty) that a random variable related to the 
internal protein can achieve is limited by random vari¬ 
ables associated with the receptor, which is the first layer 
of the network. This best performance is achieved when 
the internal protein is much faster than the receptor. In 
general, determining which random variable leads to the 
smallest uncertainty depends on parameters like the ex¬ 
ternal concentration and the chemical potential driving 
the phosphorylation cycle. If the internal protein is much 
slower than the receptors, the integration time to obtain a 
reliable estimate from the internal protein must be much 
larger than the typical time of a transition of the pro¬ 
tein, which is much larger than the lifetime of the bound 
state of the receptor. Finally, while current and time 
spent in the phosphorylated state require energy dissipa¬ 
tion for a finite uncertainty, it is possible to estimate the 
external concentration from the activity of the internal 
protein even in equilibrium, which we call dissipationless 
inference. 

Recently, we have obtained a universal bound on the 
energetic cost of the relative uncertainty of a generic 
probability current [HI- As shown here, this bound im¬ 
plies a general bound on the inference time based on a 
current variable. An intriguing question is whether it is 
possible to find such universal relations between energy 
dissipation and uncertainty for other random variables 
like the activity or the time a trajectory spends in a clus¬ 
ter of states. The formalism developed in this paper pro¬ 
vides a tool to investigate the existence of such thermo¬ 
dynamic uncertainty relations that connect energy dissi¬ 
pation with uncertainty. 


Appendix A: Derivation of expressions for the first 
class 


The maximum eigenvalue of the modified generator in 
Eq. (|S]) is denoted A(z). It can be shown that this maxi¬ 


mum eigenvalue is the scaled cumulant generatiM func¬ 
tion associated with the random variable Ai [y, 1^. The 
velocity and dispersion are related to A(z) as 

vi = A' (Al) 

and 

D, = A72, (A2) 

respectively. The lack of explicit z dependence denotes 
evaluation at z = 0. Since A(z) is a root of the polynomial 
in Eq. ®, it follows that 

N 

Y,C^{z)[X{z)r = 0. (A3) 

n—0 

Using the fact that A(0) = 0, we obtain Eqs. (fTUl) and 
m from Eqs. m and (I A 21) by taking the first and 
second derivatives with respect to z in Eq. (IA3I) . respec- 
tively. These calculations can be found in more detail in 
plL flSjl , for example. 


Appendix B: Derivation of expressions for the 
second class 


The time interval t is discretized in L intervals of size 
e, with t = Le. The time evolution of the corresponding 
Markov chain is determined by the discrete-time stochas¬ 
tic matrix in Eq. (HI- We consider a random variable 
Xe which counts in how many of the L segments the sys¬ 
tem was in the cluster of states F. We denote 4’{z) the 
maximum eigenvalue of the matrix in Eq. m- Hence, 
'tp{z) is a root of the polynomial in Eq. (fT6|) . i.e., 

N 

^c„(z)7(z)]" = 0. (BI) 

n—0 

For z = 0, the matrix in Eq. dig becomes the discrete¬ 
time stochastic matrix in Eq. (HI, which implies '0(0) = 
1. By taking first and second derivatives in Eq. dm and 
setting z = 0 we obtain 


V'' 


c' 

A^n—0 

J2n=l 


(B2) 


and 

Eto < + 20' Eti nc'^ + Wf En =2 nin - I)c, 
U — „Ar 

(B3) 

Furthermore this maximum eig envalue is the generating 
function associated with [3^, i.e., 

In0(z) = yln(e^^0. (B4) 

Jj 





FIG. 6 : Networks for which numerical tests have been per¬ 
formed. The numbers represent states and the links repre¬ 
sent transition rates that are not zero, (a) Network with four 
states. Note that for U 114 = W 41 — 0 this network of states 
becomes the same as the network from Fig. [S] (b) Network 
with five states. 



FIG. 7: Dispersion D 2 obtained from Eqs. nil) and mi) 
comp ared to numerical simulations for: (a) Network from Fig. 
6 (a) I where the time spent in state 4 is the random variable 
and the rates are W 12 = 10, W 13 = 2, W 31 = 8 , and wn = 


W21 = W24 = W42 = W 43 = W34 = 0.01. (b) Five states 

network from Fig. 6 (b) where the time spent in the cluster 
F = (1,4) is the random variable and the rates are W43 = 20, 


W34 = 8.001, W31 = 0.1, W13 = 0.005, W12 = 2, W15 = 7, 
W24 = 9.9, W21 = 4, W25 = 57, W51 — 0.02, and W52 = 0.6. 


Hence, in the large L limit, 


V'' = (B5) 

and 

r = (B6) 

L 

The first cumulant of the discrete and continuous vari¬ 
ables fulfill the relation 


second cumulant of Xe by 

{XD - (X|) ^ Axl)-{x,f 

t e—^0 L 


(B 8 ) 


The factor e on the right hand side accounts for the cor¬ 
rect dimension. This relation is equivalent to = 

lim<:^o(a^e)/.b^. Note that in Eqs. (IB7I) and (IB 8 I) the 
average in the left hand side is over continuous-time tra¬ 
jectories, whereas the average in the right hand side is 
over discrete-time trajectories. From Eqs. (|B2I) . (IB5I1 
and (IB7I1 we obtain Eq. (fT71) . From Eqs. (IB3I) . (IB 6 I) and 
(IB8I1 we obtain Eq. dn, with /(e) given by Eq. (IT^ . 

Even though we do not have a rigorous proof for Eq. 
(IB 8 I) we have verified numerically its validity for sev¬ 
eral different networks of states by comparing the dis¬ 
persion obtained from continuous-time numerical simu¬ 
lations with the dispersion obtained from our analytical 
expressions in Eqs. CHI and m- For example, we have 
performed this test for the networks of states in Fig. [ 6 ] 
with the results shown in Fig. [T] 


Appendix C: Calculations for the four-state system 


The four states of the model in Fig. [3] are identified as 
{Y, u) = 1, (Y, b) = 2, {Y*,u) = 3, and (Y*, b) = 4, where 
b (u) denotes bound (unbound) receptor. The transition 
rates are given by W 12 = W 34 = W 21 = W 43 = 

wi 3 = w-, UI 31 = a;+, W 24 = K+, and W 42 = K-. 
This four-state model has only one probability current, 
which is the same in any of the four links in Fig. [31 If 
we take the current from 1 to 2 , the respective modified 
generator in Eq. [5] is 


^j{z) 


/ 

-ri 

ui2ie ^ 

W 3 I 

0 


wi2e^ 

-r2 

0 

W 42 

V 

Wi3 

0 

-rs 

W 43 

0 

W24 

W 34 

-r4 


(Cl) 


where ri = W 12 +W 13 , r2 = W 21 +W 24 , rs = W 31 +W 34 , and 
r 4 = ?ii 42 + W 43 . The subscript J indicates the modified 
generator for the current. In the case of the activity of 
the transitions changing the internal portein state, which 
are transitions involving the pairs 13 and 24, the modified 
generator reads 


C-a{z) 


/ 

-ri 

W 21 

W3ie^ 

0 


W 12 

-r2 

0 

W42e. 

[ 

wise^ 

0 

-^3 

W 43 

0 

W24e^ 

W 34 

-r4 


(C2) 


M = M. (B7) 

This relation comes from the fact that the stationary dis¬ 
tribution of the continuous-time Markov process is the 
the same as the discrete-time Markov chain. The sec¬ 
ond cumulant of the continuous variable is related to the 


The inference times Tj and TX are calculated from the 
coefficients of the characteristic polynomial in Eq. m 
with expressions (nni), (HH), and (1301) . where for T/ the 
modified generator is in Eq. (ICll) and for Tj the modi¬ 
fied generator is in Eq. (|C2I) . In both cases, we can get 
full analytical expressions for arbitrary transition rates, 
however they are too long to be displayed here. 
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For the time spent in the phosphorylated state of the 
protein, the corresponding cluster is F = (3,4). The 
matrix in Eq. dm) then becomes 


]C{z) 


/(1-rie) W 2 ie 

Wi2t (1 - rae) 
wiaee^ 0 

V 0 u;24ee"' 


wsie 

0 

(1 - r3e)e^ 
W34ee"^ 



With the coefficients of the characteristic polynomial as¬ 
sociated with this matrix, as given by Eq. m, the infer¬ 
ence time is obtained from Eqs. (Ull), (UHl), (US)) and 
(1^ . We also get a long analytical expression for in 
terms of the transition rates. 
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